UNDER REVIEW IN IEEE TRANSACTIONS ON INFORMATION THEORY 



1 



Linear Coherent Estimation with Spatial 

Collaboration 

Swarnendu Kar, Student Member, IEEE, and Pramod K. Varshney, Fellow, IEEE 



o 

C 

OS 



C/5 

O 



> 

m 
o 
p 

o 

(N 

> 
X 



Abstract — A power constrained sensor network that consists 
of multiple sensor nodes and a fusion center (FC) is considered, 
where the goal is to estimate a random parameter of interest. 
In contrast to the distributed framework, the sensor nodes may 
be partially connected, where individual nodes can update their 
observations by (linearly) combining observations from other 
adjacent nodes. The updated observations are communicated 
to the FC by transmitting through a coherent multiple access 
channel. The optimal collaborative strategy is obtained by 
minimizing the expected mean-square-error subject to power 
constraints at the sensor nodes. Each sensor can utilize its 
available power for both collaboration with other nodes and 
transmission to the FC. Two kinds of constraints, namely the 
cumulative and individual power constraints are considered. 
The effects due to imperfect information about observation and 
channel gains are also investigated. The resulting performance 
improvement is illustrated analytically through the example of a 
homogeneous network with equicorrelated parameters. Assuming 
random geometric graph topology for collaboration, numerical 
results demonstrate a significant reduction in distortion even for 
a moderately connected network, particularly in the low local- 
SNR regime. 

Index Terms — Distributed Estimation, Wireless Sensor Net- 
works, LMMSE Estimators, Constrained Optimization 



I. Introduction 

Wireless sensor networks consist of spatially distributed 
battery-powered sensors that monitor certain environmental 
conditions and often cooperate to perform specific signal pro- 
cessing tasks such as detection, estimation and classification 
(TJ. In this paper, we consider a network that is deployed for 
the purpose of estimating a common random parameter of 
interest. After observing noisy versions of the parameter, the 
sensors can share their observations among other neighboring 
nodes, an act referred to as collaboration in this paper (fol- 
lowing J21)- The observations from the neighbors are linearly 
combined using appropriate weights and then transmitted to 
the fusion center (FC) through a coherent multiple access 
channel (MAC). The FC receives the noise-corrupted signal 
and makes the final inference. The schematic diagram of such 
a system is shown in Figure [TJ (we will introduce the notations 
and describe each block later in Section p} . 
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Fig. 1. Sensor network performing collaborative estimation. 



The individual sensor nodes are battery powered and hence 
the network is power limited. The power constraints can be 
described by the following two situations 1) Cumulative: Here 
the total power-usage in the network (summed across all the 
nodes) has to be below a pre-specified limit. 2) Individual: 
Here each node has their own power constraint as dictated by 
the capacity of their batteries. The performance analysis of 
a cumulative-constraint problem is usually simpler and more 
insightful since we have only one constraint to take care of. 
The individual-constraint problem is more practical from an 
implementation perspective but also more difficult to analyze. 
In the absence of a power limit, the sensors could collaborate 
with all the other nodes, make the inference in the network, 
and transmit the estimated parameter to the FC without any 
further distortion (by using infinite transmission power). This 
is similar to the centralized inference situation, where the 
error in estimate is only due to the noisy observation process. 
However, with limited power availability, both collaboration 
and transmission have to be performed judiciously, so as to 
maximize the quality of inference at the FC. In this paper, 
we study the following problems. For a cumulative power 
constraint, we study the optimal allocation of power resources 
among various nodes and tasks (namely collaboration and 
transmission) so as to achieve the best estimation performance 
at the FC. Regarding individual power constraints, the goal is 
to allocate power between collaboration and transmission at 
each node. 

In the absence of collaboration, this problem reduces to 
the class of distributed inference/beamforming algorithms. In 
distributed estimation, the objective is to coordinate all the 
sensors so that without communicating with one-another, they 
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collectively maximize the quality of inference at the FC. The 
quality of inference can be quantified by either the mean- 
square error (MSE) (in case of random signals) or Cramer-Rao 
lower bound (in case of deterministic unknown signals). Dis- 
tributed estimation has been extensively researched both from 
analog |3],[4|,|5| and digital [6|,[7|,[8| encoding perspectives. 
Examples of analog encoding include the amplify-and-forward 
(AF) scheme, where the nodes amplify the raw observations 
and transmit to the FC by either forming a coherent beam 
in a multiple access channel or using their own dedicated 
links with the FC for transmission (sometimes referred to as 
orthogonal MAC) J4J. The AF framework appears extensively 
in the literature [3'|,[5|,[5) due to its simplicity of implementa- 
tion in complex networks and provable information-theoretic - 
optimality properties for simple networks ifTUll . In another 
research direction, quantization of the observations may be 
performed prior to transmission lfTT1 .ll6l. The quantized ob- 
servations are then communicated to the FC using digital 
communication, where further information may be lost due to 
channel errors fl2l . Another closely related field of research 
in communication theory is distributed beamforming in relay 
networks [13j, fl4l . lfT31l . where the objective is to maximize 
the signal-to-noise ratio (SNR), rather than minimize the 
estimation error, at the FC. These two problems are sometimes 
related, as one might imagine. When the observation and chan- 
nel gains are perfectly known, the SNR and MSE functions 
are monotonically related and the two problems are equivalent. 
However, in the presence of observation and channel gain 
uncertainties, the SNR and MSE functions are different. 

Though distributed inference/beamforming has been widely 
studied, research regarding collaborative estimation is rela- 
tively nascent. When the transmission channels are orthogonal 
and cost-free collaboration is possible within a fully connected 
sensor network, the optimal strategy is to perform the inference 
in the network and use the best available channel to transmit 
the estimated parameter 0. In a rate-constrained framework 
considered in |16|, it was shown that spatial collaboration 
can be used to whiten the observation space, thereby enabling 
efficient resource allocation when the noise is correlated. In 
a preliminary version of this paper ifTTl . we have considered 
an extension of the AF framework, where sensors are able 
to linearly combine the observations from neighboring nodes 
before transmitting to the FC. We obtained the optimal cu- 
mulative power-distortion tradeoff when a fixed but otherwise 
cost-free collaborative topology is used to transmit over a 
coherent MAC channel. In this paper, we extend the problem 
formulation in three new directions, namely a) consideration 
of individual power constraints, b) consideration of imperfect 
information about observation and channel gains (the second 
order statistics are assumed to be known), and c) consideration 
of finite costs associated with collaboration. The primary 
contributions of this paper are as follows 

• Extending the amplify-and-forward framework to formu- 
late and analyze the problem of estimation with spatial 
collaboration 

• Defining a metric called collaboration gain, that quanti- 
fies the worthiness of spatial collaboration as a tool to 



enhance the estimation performance 

• Demonstrating that for a fixed but otherwise cost-free 
(ideal) collaborative topology, the resulting optimiza- 
tion problem reduces to an eigen-decomposition problem 
for the cumulative-constraint case. For the individual- 
constraint case, accurate numerical solution can be ob- 
tained by solving several semi-definite feasibility prob- 
lems. We investigate both the cases further by deriv- 
ing/analyzing the optimal achievable distortion and the 
corresponding weights for some special collaborative 
topologies like the distributed (no-connections), partially 
connected cycles and fully connected cases. We also 
derive the explicit expression of collaboration gain for 
a homogeneous network with identical channel and ob- 
servation gains and equicorrelated observation noise. In 
particular, we demonstrate that collaboration is particu- 
larly effective in a certain power regime that depends on 
various factors like the skewness (or variability) of power- 
availability in the network, uncertainty in observation and 
channel gains, and the correlation of the measurement 
noise. 

• Addressing the design of collaborative topologies where 
finite costs are involved in collaboration. We suggest an 
efficient algorithm that uses the results for the fixed- 
topology but cost-free case to find locally optimal so- 
lutions for the finite-cost case. 

The rest of the paper is organized as follows. In Section 
|m we formulate the problem after describing each block 
of the system in Figure [T] We define "collaboration gain" 
(CG) that is normalized with respect to the operating region 
(CG 6 [0, 1]). It summarizes the efficacy of collaboration 
across various problem conditions. In Section [Til] we solve 
the optimal transmission-power allocation problem for a fixed 
but otherwise cost-free collaborative topology, (i.e., extend the 
results of [17| to address points (a) and (b) in the previous 
paragraph). We also derive explicit expressions for collabo- 
ration gain for a homogeneous network with equicorrelated 
noise. In Section |IV| we address the problem with finite 
collaboration costs and suggest a greedy algorithm to obtain 
a locally optimal solution in polynomial time. Concluding 
remarks are presented in Section [V] 

II. Problem Formulation 
A. Linear Sensing Model 

We consider the scenario where the parameter of interest is 
a scalar random variable with known statistics, specifically, 
Gaussian distributed with zero mean and variance rj 2 . The 
observations at the sensor nodes n = 1, 2, . . . , N are governed 
by the linear model x n = h n 8 + e„, where h n is the 
observation gain and e n is the measurement noise. The second 
order statistics of the observation gain h = [hi, . . . , hfi\ T 
is assumed to be 

Eh = h, varh = £ h . (1) 

The measurement noise e = [ei, £2, ■ ■ ■ , £n] T is assumed to 
be zero-mean, Gaussian with (spatial) covariance vare = S. 
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Perfect knowledge of the observation model statistics h, S/, 
and X is assumed. In vector notation, we have 



where x = [x\, X2, 



x = h9 + e, (2) 
, xn] t denotes the observations. 



B. Linear Spatial Collaboration 



We consider an extension of the analog amplify-and-forward 
scheme as our encoding and modulation framework for com- 
munication to the fusion center. In the basic amplify-and- 
forward scheme, each node transmits a weighted version of its 
own observation, say W n x n , with resulting power W^E[x^]. 
Such a scheme is appealing and often-used [4|,[3|,|2] due 
to two reasons, 1) Uncoded nature: Does not require block 
coding across time and hence efficient for low-latency systems, 
2) Optimal in select cases: For a memoryless Gaussian source 
transmitted through an additive white Gaussian noise (AWGN) 
channel (Figure [T] with N — 1), an amplify-and-forward 
scheme helps achieve the optimal power-distortion tradeoff in 
an information-theoretic sense (see Example 2.2 in [18]). The 
optimality of linear coding has also been established [19| for 
distributed estimation over a coherent MAC (Figure [TJ without 
spatial collaboration). 

In general, all the N data-collecting sensor nodes in a net- 
work may not have the ability to communicate with the FC. In 
that case, they may still pass their information (through the act 
of collaboration) to another node which has a communication 
link with the FC (see the network in Figure [3] for example). 
We assume that the nodes are ordered in such a way that the 
first M nodes (where M < N) are able to communicate with 
the FC. Let the availability of collaborative links among the 
various nodes be represented by the AIxN zero-one adjacency 
matrix (not necessarily symmetric) A, where A mn £ {0, 1}. 
An entry A mn = 1 signifies that node n shares its observation 
with node m. Sharing of this observation is assumed to be 
realized through a reliable communication link that consumes 
power C mn , regardless of the actual value of observation. The 
MxN matrix C describes all the costs of collaboration among 
various sensors and is assumed to be known. Since each node 
is trivially connected to itself, A mm = 1 and C mm = 0. We 
denote the set of all A-sparse matrices as 



{W e 



<,MxN 



W mn = if A mn = 0}. (3) 



Corresponding to an adjacency matrix A and an A-sparse ma- 
trix W, we define collaboration in the network as individual 
nodes being able to linearly combine local observations from 
other collaborating nodes, 



=1,...JV 



1,- 



,M. 



(4) 



A mn = l 

In effect, the network is able to achieve a one-shot spatial 



transformation W : x — » z of the forrrQ 
z = Wx, W € S A - 



(5) 



We refer to W as the matrix containing collaboration weights. 
It may be noted that, 1) Particularization: When W is a 
diagonal matrix (equivalently, A is the identity matrix I m), 
our collaborative scheme simplifies to the basic amplify-and- 
forward relay strategy as in [5], [3], 2) Collaboration cost: Any 
collaboration involving W € Sa is achieved at the expense 
of power 



n=l 



at node m, and cumulatively 



M 



(6) 



(7) 



for the entire network, and 3) Transmission cost: The power 
required for transmission of encoded message z m at node m 
is, 

P W ,m = ^ehe W] = [Ty.ExW T l , where (8) 

= E e,h, £ i xxT ] = S + V 2 (hh T + S h ). (9) 

Consequently, the cumulative transmission power in the net- 
work is 



M 



Pw — Pw,m — Tr 



WE X W J 



(10) 



C. Coherent Multiple Access Channel 

The transformed observations z are assumed to be transmit- 
ted to the fusion center through a coherent MAC channel. In 
practice, a coherent MAC channel can be realized through 
transmit beamforming [20], where sensor nodes simultane- 
ously transmit a common message (in our case, all z m -s 
are scaled versions of a common 6) and the phases of their 
transmissions are controlled so that the signals constructively 
combine at the FC. Denote the channel gain at node m by g m . 
The second order statistics of the channel g = [51,52, ... , c/m] 
is assumed to be, 



Eg = g, vart/ = £ g , 



(11) 



and the noise of the coherent MAC channel u is assumed to 
be a zero-mean AWGN with variance £ 2 . Perfect knowledge of 
the channel statistics g, S g and £ 2 is assumed. The output of 
the coherent MAC channel (or the input to the fusion center) 



It is worth emphasizing our assumption that though collaboration incurs 
a fixed cost (in terms of power consumed that could otherwise have been 
used for transmission), it is otherwise reliable, in the sense that the act of 
collaboration does not incur any errors. This can be implemented by, say 
communicating in a digital framework with sufficient precision and ensuring 
sufficient channel coding to counter the channel noise. We abstract this process 
by assigning a cost to the link when it is required to be used. An interesting 
problem which is worthy of research but beyond the scope of this paper, is 
to assume possibly erroneous collaboration, where z = Wx + (J (say) and 
errors incurred during collaboration (£) decrease with collaboration power. 
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IS 



y = g Wx + u, u~Af(Q,£ 2 
= g T WhO+ g T We + u . 



(12a) 
(12b) 



net gain 



net zero-mean noise 



D. Linear Minimum Mean Square Estimation 

Having received y, the goal of the fusion center is to obtain 
an accurate estimate 9 of the original random parameter 9. We 
restrict our attention to linear estimators of the form 9 = ay, 
where a is a fixed constant subject to design. We consider the 
mean square error (MSE) as the distortion metric 

u [{6-ay) 2 -W]. (13) 



V w (a) 



From the theory of linear minimum mean square estimation 
(LMMSE, see ED, Chapter 12), we readily obtain that 



olmmse = argminXV(a) 



and 



D w — V w (a LMM s E ) — rj — 



E[y 2 } ' 
W}) 2 



K[y 2 



(14a) 
(14b) 



where the above expectations are w.r.t. all random variables 
{9, h, e, g, u}. From ( |12a| i and ( |12b| i, we obtain 



E[y 2 ] 
E[y6] 



Tr E g WE x W T 
rfg T Wh, where 



and 



-En =E 



gg 



gg 



(15) 
(16) 



where E x is defined in i9). 

Remark: Perfect observation gain and channel state infor- 
mation (OGI and CSI): When the observation and channel 
gains are precisely known, i.e., Sh = S g — 0, ( |12b| i reduces 
to a linear Gaussian model conditioned on 9, 



8 ~ Af{g T Wh9, g T WT,W T g + £ 2 



(17) 



and hence the LMMSE estimator is also the minimum mean 
square estimator (MMSE) ll2D . 

#LMMSE := OLMMSE2/ = E0,e,u[#|y] =: #MMSE, (18) 

i.e., 6>lmmse minimizes the distortion over all possible estima- 
tors (not just within the linear class). 

We know from the theory of MMSE estimation (see Ell . 
[22 1) that the optimal distortion Z?mmse is related to the Fisher 
Information (FI), 

d 2 9 



J 



'< v \e 



d9 2 



p{y\o) 



(19) 



by -Dmmse > y^ji + Jj in general and that equality holds 
for linear Gaussian models of the form ( [17) . Though the FI 
(Equation ( fT9] >) results in 

.2 



J = 



i T Wh) 



gTWZW 1 g + i 2 



(20) 



for the case of perfect OGI and perfect CSI (the linear 
Gaussian model in (fT7j)), the FI is difficult to derive for cases 
when the observation and channel gains are uncertain. In fact, 



this is the main reason why we consider LMMSE estimation 
(which is suboptimal in general but easier to compute) rather 
than MMSE estimation (which is optimal but difficult to 
compute). 

For the purposes of notation in this paper, for all cases 
(whether we have perfect OGI/CSI or not), we would find 
it convenient to work with the quantity 

Jw = ^--\- (21) 

as a surrogate for the LMMSE distortion Dw (as in ( |14b) ). 
Note that D w and Jw are mono tonic ally related and that 
minimizing Dw is equivalent to maximizing Jw- Motivated 
by the preceding discussions, we would refer to Jw as the 
equivalent Fisher Information, or sometimes simply FI or even 
distortion, for the sake of brevity. 

E. Problem Statement 

The design of the collaboration weights W is critical 
since it affects both the power requirements and estimation 
performance of the entire application. Specifically, the fol- 
lowing quantities depend on W, 1) the resources required to 
collaborate, i.e., Qnz(wo,rnfl ^ or individual nodes or Q n z(w) 
cumulatively for the network (see (|6]l and |7]i), 2) the resources 
required to transmit, i.e., Pw.m for individual nodes or Pw 
cumulatively for the network (see <|8j and (fTO)), and 3) the 
final distortion of the estimate at the FC, Dw, provided by 
( |14b) . In this paper, we address the problems of designing the 
collaboration matrix that minimizes the distortion subject to 
either 1) a system- wide cumulative power constraint, 



D w 
subject to Pw - 



minimize 
w 



nz(W) 



< P L 



(22) 



or 2) power constraints at individual sensor nodes, 



minimize D 
w 



w 



subject to P W ,m + Quz(W) 



<P£,m= 1, 



,M. 



(23) 

We note that problem ( |23] ) (with M individual power con- 
straints) is more realistic from a deployment point of view, 
since various nodes in a network can possess significantly 
different power sources, based on age of deployment or 
make/type of the batteries. However, problem ( f23j ) is signifi- 
cantly more difficult than problem ([22 1, which has only one 
cumulative power constraint. Problem (22| is more important 
from a system design and analysis point of view, since it is 
more tractable analytically and as a result, reveals significant 
insights on the various system level tradeoffs. 

F. Solution Methodology 

Both problems ( |22] i and ( |23] l, in general, have no known 
procedure that efficiently computes (in polynomial-time) the 

2 Definition of operators nz(-), zero(-), and nnz(-): The operator nz : 
jjiVxiV _^ {o, \y NxN is used to specify the non-zero elements of a matrix. 
If Wij + 0, then [nz(VK)] ij = 1, else [nz(W)] i:j = 0. Similarly, the 
operator zero : R NxN — > {0, iy NxN i s used to specify the zero elements of 
a matrix, [zero(W)] y = 1- [nz(W)] i .. The operator nnz : R NxN -> Z + 
is used to specify the number of non-zero elements of a matrix. 
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globally optimal solution(s). However, for the special case 
when the entries of the collaboration cost matrix C are either 
zero or infinity, Cm € {0, oo}, we will show that globally 
optimal solutions for both the problems can be obtained using 
efficient numerical techniques, and for problem \22) , even a 
closed-form solution can be derived. 

Physically, this special case corresponds to the situation 
when the topology of a network is fixed (and hence not subject 
to design) and communication among neighbors is relatively 
inexpensive compared to communication with the FC. Let 
A = zero(C) denote the permitted adjacency matrix for 
such a situation. Hence, the collaboration costs vanish, and 
problems (|22|i and (|23]l simplify to 



minimize 

wes A 



D 



w 



minimize 

wes A 



subject to P w < P L 
D w 



and 



Subject to Pw,m < Pmi m — !)•••) M. 



(24) 



(25) 



respectively, which are optimization problems in nnz(A) 
variables. Since problems ( p4] > and ( |2~5j ), which will be solved 
in Section [III] arise out of the assumption of zero-cost for 
collaboration, we would refer to them as ideal-collaborative 
problems. 

For the general case, when the topology is flexible and col- 
laboration incurs a finite cost, a polynomial-time sub-optimal 



algorithm is proposed in Section IV where the bigger problem 
is broken into smaller sub-problems, where several ideal- 
collaborative problems of the form |24) or |25) are solved 
at each iteration. Specifically, we start from the distributed 
topology A = [Jjvf|0], and follow a greedy algorithm to 
augment the collaborative topology with the most power- 
efficient link at each iteration. 

G. Performance Metric - Collaboration Gain 

Let the optimal solutions to problems ( |22| and ( |2"3j) be 
denoted by D op t(P G ), where (note bold notation) P u = 
P c for a cumulative-constraint (problem ( |22] i) and P c = 
[Pp, P$, Pm\ T for individual constraints (problem (|23|)). 
Generally, the distortion £>opt(-P C ) depends on a number of 
problem conditions other than P c , which includes 1) the 
source variance rj 2 , 2) the noise variance X, 3) the observation 
gain statistics {h, Eh}, 4) the coherent channel gain statistics 
{g, S g }, and 5) the power needed to collaborate (matrix 
C). However, to assess and compare the benefits of spatial 
collaboration for a wide-range of problem conditions, we seek 
a metric that is normalized with respect to the operational 
region. Towards that goal, we define a few quantities. 1) Let 



D ±D, 



opt 



oo; A = 11 J 



(26) 



denote the optimal distortion that can be obtained with ar- 
bitrary collaboration and without any power constraints (the 
actual value of Do will be derived later, see (j5TJ for an early 
preview). 2) Let 



D, 



dist/ pC\ a n 

opt {-^ ) — U ' 



opt 



P C : A = [I M \0] 



(27) 



denote the optimal distortion for the distributed scenario, i.e., 
transmission power is optimally allocated among sensors and 
there is no collaboration among them. 3) Also let 



D$ n (P C ) ^ #opt 



P ; A = 11 J 



(28) 



denote the optimal distortion for the fully connected collabo- 
rative topology. Note that 



< £>& nn (P c 



< D. 



Infinite power 
Full collab. 



Finite power 
Full collab. 




Zero power 
(prior only) 



(29) 



where rj 2 is the worst-case distortion that corresponds to the 
prior information only. Equation ( |29] i is illustrated in Figure 
[2] where a typical operational region is depicted alongwith the 
power-distortion tradeoff for distributed and connected topolo- 
gies for the cumulative-constraint problem^] The goal of any 
estimation application is to close as much of the performance 
gap (rj 2 — Do) as possible using limited resources and spatial 
collaboration is a tool that enables efficient allocation of those 
resources. We are now in a position to define as Collaboration 
Gain (CG), the following normalized (centered and scaled) 
metric, 

(30) 



cg = ^(P c )-D^(P-) 



V 2 -D 



Distributed 
Connected 



gain due to 
collaboration 




0.001 0.01 0.1 1 10 

Transmission Power, P 

Fig. 2. A typical power-distortion curve illustrating collaboration gain. 

Note that < CG < 1, which means that efficacy of 
collaboration can now be summarized for a wide range of 
problem conditions. For example, if for a problem CG = 0.01 
(say), we might conclude that collaboration is not sufficiently 
beneficial for that particular problem. On the other hand, if 
CG = 0.2 (say), we would conclude that spatial collaboration 
closes the realizable performance gap by 20% and hence, may 
be worth considering. 



3 As we shall see later, for the cumulative-constraint problem, all the 
available power must be used at optimality, i.e., P op t = P , and hence the 
subscript (C) is dropped from P c in Figure [2] 
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III. Main Results - Ideal Collaborative power 
allocation: 

In this section, we consider the situation when the entries 
of the collaboration cost matrix C are either zero or infinity, 
Cij € {0, oo}, i.e., we will solve problems |24) and ( [25) , 
where the topology A = zero(C) is assumed to be fixed and 
not subject to design. 

A. Explicit formulation w.r.t. non-zero weights 

From ( |14b| i and ( |2"T) , we note that minimizing the distor- 
tion D\\r is equivalent to maximizing the equivalent Fisher 
Information, 

, 2 



Jw — 



{g T Wh) 



Tr 



E g WE x W 



?7 2 (g T Whf + e 



where D w and Jw are related by Equation ( f2l) . Part of the 
numerator and denominator of Jw and also the expressions 
for power (([8]) for individual and ( flu) for cumulative) are 
quadratic functions of the non-zero elements in W. To see 
that explicitly, we concatenate the elements of W (column- 
wise, only those that are allowed to be non-zero), in w = 
[wi, W2, ■ ■ ■ , wl] t ■ For I = 1, 2, . . . , L, define indices mi and 
ni such that wi — W mi . ni . Further, we define Lx L matrices 
Ojn, O jd , top,m, = Z)m=i n P,m and L x N matrix G 
such that the following identities, 

w T G, 




>w 



Tr 



EgWE^W 1 -rf(g T Wh) 



Pw=Tr 



WE X W J 



M 



WEyW 1 



(32a) 



(32b) 



(32c) 



are satisfied. Precisely, the elementwise descriptions for all the 
matrices are as follows, 



9mn n = n U 

otherwise 



i G kn = { ^0 
[^JN]fe,i = gm k gm l h nk h ni ^ OjN = Ghh 1 G T ', 

[n JD ] k , = [E g ] mi mi [e x 



>k,l 



0, otherwise 



V [ fi JN] fe ,/ , and 
m, 



for k, I = 1, 2, . . . , L, n = 1, 2, . . . , N and m = 1, 2, . . . , M. 

Though Ojn is rank-1 (as described above), in general, there 
are no compact expressions for the matrices JIjd and J7p, m . 
We illustrate some relevant matrix definitions (Op m and G 
in particular) through an example, in Figure [3] with N — 4 
data-collection nodes, M = 3 communicating nodes and 3 
collaborating links, resulting in a total of L = 6 non-zero 
coefficients in the collaboration matrix W. 

For some special cases and regular topologies, more com- 
pact expressions for J~2jd may be derived. For the special case 




\5i 



A 

w->W 



"1 








0" 




w 1 














1 


1 










w 2 


w 4 





.0 


1 


1 


1. 







w 3 


w 5 


w 5 





n P = 


n P1 


+ n 


P2 + n 


P,3 




(E,j denotes [BJ V ) 
























E 22 





^23 














^22 





^23 


^24 





^32 





^33 


) 











^32 





£33 


^34 


. 





^42 





£43 


E44 





g-z 
g-i 










g-z 
g-i 










g-i 



(31) Fig. 3. Matrices for problem formulation in explicit form - an example. 



when perfect channel state information is available (S g = 0), 
it is easy to see that 

JD = G (e x - rj 2 hh T ) G T 



= GTiG T , where S = S + t] 2 ^. (34) 

This simplification will also be useful later in our discussion. 

With the help of these definitions, the objective function 
(Fisher Information) is simplified as 

A w T n JN w 

and problems |24| and |25| are re-written as 
maximize J w 

w 

subject to w T ftpw < P c , and 
maximize J w 

subject to w T ftp <m w < P^, m = 1, ... , M. 

respectively, which are both optimization problems with the 
same fractional-quadratic objective ( |3"5j ) and single (or mul- 
tiple) quadratic constraint(s). Solution of problems ([36]) and 
( |37| ) will be provided in Sections III-B and III-C respectively. 



(36) 



(37) 



(33) B. Cumulative power constraint 



Since multiplying to by a scalar a > 1 (strictly) increases 
both J w and power w T flpw (and for a < 1, strictly decreases 
them), problem po} is equivalent to its converse formulation, 
where power is minimized subject to a maximum distortion 
constraint (represented by J c ), 

minimize w T flpw 

(38) 

subject to J w > J , 

in the sense that the optimal solutions J op t(P c ) (of (i6[ ) 
and P opt (J c ) (of ( [38) )) are inverses of one another. Moreover, 
the optimal solutions hold with active constraints (satisfying 
equalities P = P° forQ and J = J° for ([38)). From ([35), 
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problem ( |38j ) is further equivalent to, 
minimize w T flpw 

(39) 

subject to w T (Jfijo - Hjn) w + J£ 2 < 0, 

which is a quadratically constrained quadratic program 
(QCQP) in L = nnz(A) variables. Note in (|39j that, though 
rip is positive definite (it is composed of blocks of another 
positive definite matrix E x , see Figure [3] for example), the 
matrix Jfijo — fijN is not, and hence problem ( |39| ) is not 
convex. However, a QCQP with exactly one constraint (as in 
problem (|39]l) still satisfies strong duality (for a background, 
see Appendix B, 11231 ) and hence the optimal solution to ([39]) 
satisfies the Karush-Kuhn-Tucker (KKT) conditions 



(f2 P + /i ( Jfi JD - f2j N )) w = 0. 



(40) 



Together with the following active constraint conditions at 
optimality 

P = w T rtpw, and w T ( JJ~2jd - Ojn) w + J£ 2 = 0, 

(41) 

which implies — -j^, the solution to problem ([39]) (equiv- 
alently, problems ([38), ( f3"6] l and ( |24] i) is summarized below. 

Theorem 1: (Power-Distortion tradeoff for Linear Coherent 
Ideal-Collaborative Estimation) For a given topology A, let 
J € (0, Ag.max (J^jN: ^jd)Q The optimal tradeoff between 
distortion (represented by J) and cumulative transmission 
power P and also the optimal weights w that achieve that 
tradeoff, are obtained through the solution of the generalized 
eigenvalue problem, 

J7p J~2jn 



Pf 



J 



In particular, the function J p\(P) 
its inverse P op t(J) are 



J~2jd ) w = 0, where = 

(0,oo) ->■ 



P 
(0,J ( 



(42) 



and 



'JD 



fopt(J) 



xpos 
A S,min 



r2 P , -n. 



ft 



JD 



JN \ 

J J 



and 



i 2 



(43a) 



(43b) 



respectively. This optimal tradeoff is achieved when the 
weights of collaboration matrix is an appropriately scaled 
version of the (generalized) eigenvector corresponding to ( |43a[ ) 
(or ( |43b[ ), since they are equivalent), say t) pt- That is, w; pt = 
cu opt, where the scalar c is such that w^ pX D,pw opi = P. 

Theorem [TJ is important since it helps to (numerically) com- 
pute the power-distortion tradeoff for arbitrary problem condi- 
tions (like topology, noise covariance, second-order statistics 
of the observation and channel gains). Since the numerical 
complexity for eigenvalue problems is roughly cubic in the 

4 Definitions of eigenvalue related operators: The operators A(P) and v(P) 
denote the solution(s) to the ordinary eigenvalue problem Pv = Ad. Operator 
Amax(-) denote the maximum among all real eigenvalues and denote 
the minimum among all positive eigenvalues (i.e., the positive eigenvalue 
that is closest to, but different from, zero). The operators Xg(P,Q) and 
vg (P, Q) denote the solution(s) to the generalized eigenvalue problem 
Pv = XQv. Operators Ag max (-,-) and ^g m j n (',0 are similarly defined 
as Amax(-) and A^(-) respectively. Note that, when Q is full-rank, then 
Xg(P,Q) = X(Q- 1 P). 



size of the problem (see for example 11241 ~). the complexity 
of computing the (cumulative) power-distortion tradeoff is 
0(L 3 ), where (recall that) L = nnz(A) is the number of 
non-zero collaboration weights. 

Corresponding to the example topology in Figure [3] and 
randomly chosen system parameters h, S and g, a typical 
power-distortion tradeoff curve is shown in Figure [2] (bold 
line). Theorem [TJ can be simplified further for several specific 
scenarios, allowing deeper insight into the power-distortion 
tradeoff as it relates to the problem parameters. The first 
obvious simplification is because of the rank-1 property of 
Ojn. The only non-zero generalized eigenvalue is (provided 
the inverse exists) 



^opt(^) = h 1 G 1 n 



with eigenvector w opt oc I fijp 



Gh, (44a) 



Gh. 



(44b) 



Equation ( |44a| l explicitly shows the effect of finite-power 
constraint P^ on the distortion. Some other insightful examples 
are discussed next. 

Example 1: For the case of perfect CSI (S g = 0), we note 



from ([34]i that fi JD 

Jopt{P) = -^e,max 



GSG . It follows that 



Ghh T G T , 



~p! 



(o) , 

— ^S.max 



hh T , S 



Pi 



Pf 



p = 



h. 



(45) 



(46) 



(47) 



assuming all the inverses exist. Step (a) reduces the size 
of the eigenvalue problem from L to N yet preserving the 
non-zero eigenvalues. Note that the corresponding generalized 
eigenvectors of problems (|45]> (say v^) and (|46]i (say v^) are 



related by vl = ^ p GYpv^. Step (b) describes the only 



non-zero generalized value of problem ( |46[ l, since hh is 
rank-1. Note that optimal collaboration weights are provided 

by ti; pt oc fip^GS (s + ^ h. 

When, in addition to perfect CSI, we also have perfect OGI 
(Sh = 0), Equation ( |47| ) further simplifies to 

-l 



•/opt (P) = h 1 



Pa 



h. 



(48) 



which was obtained in a preliminary version of this paper 
ifTTll . Equation ( |48] i can be compared to the centralized case, 
where measurements Xn tire directly observed through gains 
h and measurement noise with variance X, for which the 
Fisher Information is J cen t = h T 'E~ 1 h, which is also the 
infinite power limit of ( |48] i. One can think of the additional 
quantity i n ( |48] i, which factors in the effect of channels, 
the collaboration topology and finite transmission power, as 
equivalent to the variance of an additional noise that is added 
to the measurement noise. 

When, in addition to perfect OGI and CSI, we also have a 
distributed topology ( A — I m) and the measurement noise is 
uncorrelated (X is diagonal), we can proceed as follows. We 
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have w = d\ag(Wf\ This means that fl P = diag (diag (£ + 
rj 2 hh T )) is a diagonal matrix with m th element as a^+rfh^, 
and G = diag(g). Consequently, Tp is a diagonal matrix 

^ 2 h 2 

— — . Hence, the optimal power- 



with m th element as ^ Jm - L 

9... 

distortion tradeoff in Equation d48b further simplifies to (define 



— y 



and j m = 



) 



M 



(49) 



1 + Tn 

which was also obtained in (3). Since for the centralized case, 
we have J cen t = Ylm=i o 2 ^' Equation ( |49| indicates the exact 
fractions of individual Fisher Information that "reaches" the 
receiver. When subjected to a network-wide power constraint, 
information from the more informative (higher 7 m ) and less 
reliable (lower g m ) sensor undergoes a higher degree of 
"attenuation". While a higher observation gain h m clearly 
carries more information, it also requires quadratically higher 
power to transmit in an amplify-and-forward framework such 
as ours (note that P m = w^cr^ + rfti^)). Similarly, a 
lower magnitude of channel gain g m implies that quadratically 
higher transmission power is needed to compensate for the 
channel. Hence, in the optimal tradeoff (|49]l, it turns out 
that information from higher-/i m and lower-g m sensors are 
attenuated by a larger factor. 

Example 2: When the network is fully connected (A = 
11 T ), we proceed from ( |44a| i and obtain the following result. 

Proposition 2: The optimal solution for Example 2 is 



J 



1 



opt(P) 
where J = fa T £ 



r, 2 J 



and Wopt oc uv 7 



h, Q = g £ Q g, u = £ g, 



(50) 



v = £ h, £ = £ 



2 £ h and £ a = £ 



Pf 



Also, for the special case with perfect OGI and perfect CSI 
(when £ n = £ g = 0), the resulting distortion is information 
theoretically optimal. 

Proof: See Appendix |A| ■ 
This example is important since a connected topology 
makes use of all possible collaboration links and hence Equa- 
tion ( |50"1 ) gives the LMMSE performance limit for a network 
with cumulative transmission power constraint. From (|50j, we 
are now in a position to compute the optimal achievable Fisher 
Information Jq (and equivalently the distortion Dq), by letting 
the power go to infinity, 



Jq = J 



1 



1+rfJ 



where G = g £ Q g. (51) 



As discussed earlier, Dq (definition in ( |26] i) forms a lower 
bound for distortion in a network and is useful to characterize 

'Definition of operators diag(-) and vec(-): While operating on a matrix, 
diag : R MxN -> ]Rnnin(A/,JV) ; s used tQ extract the diagonal elements. While 
operating on a vector, diag : M. M — ► K MxM ; s used to construct a matrix 
by specifying only the diagonal elements, the other elements being zero. The 
vectorization operator vec : M MxN — > M. MN stacks up all the elements of 
a matrix column-by-column. 



the operational region and subsequently define the collabora- 
tion gain (Equation (|30l>). 

From Equation ( |50] l, it can be explicitly seen that for a fully 
connected topology, more observation or channel uncertainty 
always deteriorate the estimation performance, a notion that is 
intuitive but still was not completely evident in the discussion 
so far. The following result formalizes this notion. 

Proposition 3: If £ n 1 >- £^2 and £ g i >- £ g 2 (here A y 
B implies that A — B is a symmetric positive definite matrix), 
then for a connected topology, 



(52) 



Proof: This is established by re-writing Equation ( |50| > as 

-1 



Jo P t(P) 



1 



Q 



(53) 



Q) J 

which shows explicitly that J pt(-P) is a monotonically 
increasing function of J and Q. Recall that J = 
h T (£ + ?7 2 £ h ) _1 h and Q = g T T lg 1 g. Assuming £ h i >- 
£ n ,2 and £ g i >- £ g ,2, it is sufficient to show that a) 
J1 < J 2 and b) Q-\ < Q2, both of which are evident from 
the inequality involving positive definite matrices in Lemma 
[10] (see Appendix [B]). ■ 

The last two examples provided insight on the distributed 
and fully connected topologies respectively. The following 
example addresses a partially connected topology and shows 
how the distortion decreases with an increase in links available 
for collaboration. 

Partially connected cycle graphs: In Figure |4j we display 
a class of graphs, namely the (K — 1) connected directed 
cycle, for K = 1,2,..., AT, in which each node shares 
its observations with the next K — 1 nodes. The adjacency 
topology of such a graph will be denoted as A = C(K). Note 
that K = 1 denotes the distributed scenario while K = M 
denotes the fully connected scenario. 



© 



© 



© 



© © 

A = C(l) 

1 

10 

10 

1 

1 

N = 5,K =1 





4 = C(2) 
10 



1 1 





1 

1 1 

1 1 



1 




A = C(3) 

1110 

1 
1 
1 



N = 5,K =2 



111 
11 
10 1 
110 

N = 5,K =3 



Fig. 4. Directed cycle graphs, (K — 1) connected. 

Example 3: We assume that the collaborative topology is 
a (K — 1) connected directed cycle (A — C(K)) and the 
channel gain and uncertainties are such that the network is 
homogeneous and equicorrelated. In particular, we denote a) 
the expected observation and channel gains by h 2 and g 2 ., b) 
the observation and channel gain uncertainties by ct n and a g , 
and c) measurement noise variance and correlation by a 2 and 
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p, and thereby assume 

h = h y/ohl, Sh = hl(l 

2 



Oh) I, 

9 = ffoV^Q 1 ' S 9 = 9o(l - &g)I, and 
S = a 2 ((l-p)/ + pll T ). 



(54) 



These assumptions provide an analytically tractable example 
that is representative of a broad range of problem conditions 
and will also be used in the subsequent discussions. In addition 
to covering the partial collaboration (1 < K < N) regime, 
note that a n = 1 implies perfect OGI, a g = 1 implies perfect 
CSI and p — implies uncorrected measurement noise. 
Equation ( |44a| > can be simplified further for this example, to 
obtain the following result. 

Proposition 4: The optimal solution for Example 3 is 



h 2 r 



Jopt(P) 
1 

+ N 
where pt — p 



Pn + au\PN + 
1 

1-P 



7 + Pk + 



j] 2 h\ 



1 



and a = 1. 



(55) 

Proof: See Appendix |C[ ■ 
Equation ( |55] > helps us quantify the efficacy of collaboration 
in a partially connected network. For example, when the 
measurement noise is uncorrelated (p = 0) and we have 
perfect OGI (a n = 1> equivalently — 0) and perfect CSI 
(a g = 1, equivalently 5 g = 0), Equation ( |55] l reduces to 



J = 



K 



(56) 



For the high-(TVP) regime, (i.e., when J is large and D w 
J -1 ), we can now compare the power requirements of a 
distributed topology (say P dist , for X = 1) with that of a 
(If — 1) connected topology (say P C ( K '), provided an identical 

pdist pC{K) 

distortion performance is desired. Since = , , , this 
implies that the relative savings in power is 



7+1 



pdist _ pC(K) 
pdist 



1 - 



7 + 1 



(57) 



which depends on the local-SNR 7. For example, when the 
local-SNR is large, say 7 = 100, then even a fully connected 
network (large K) can provide only 1% power savings. On 
the other hand, if the local-SNR is small, say 7 = 1, then 
even a 1-connected network (for which K = 2) can provide 
25% efficiency in power savings. The conclusion is that one 
needs to be judicious in the design of collaborative topologies, 
especially when there are overhead costs associated with it. 
The design of collaborative topologies with finite collaboration 
cost will be discussed later in Section llVl 



C. Individual power constraints 

In the previous subsection, we have discussed the solution 
to the ideal-collaborative power allocation problem with a 
cumulative transmission power constraint. In this subsection, 
we consider the case when the individual nodes have separate 



power constraints. We recall problem ( [37 



maximize 



W T fljoW + £ 2 

subject to w T ftp m w < P^, m - 



(58) 



1. 



,M. 



Let J op t be the optimal solution of problem ( |58[ l, although 
it is not clear yet how this solution can be obtained. Unlike 
problem ([36) for which a closed from solution was derived 
in Section III-B in general, there are no known closed form 



expressions of J opt . However, as we shall show below, J opt 
can still be precisely obtained using an efficient (polynomial 
time) numerical procedure. For some special cases though, 
(somewhat) closed form and insightful expressions for J opt 
can be obtained, which will be discussed later. 

1) Numerical Solution: We would use the semi-definite 
relaxation (SDR) technique for quadratically constrained prob- 
lems IT3l . ||251 to solve problem ( |58] l. The SDR technique is 
widely used in the literature since it can reduce an otherwise 
intractable problem to one with polynomial time complexity. 
However, the main drawback of SDR technique is that, in 
general, it can guarantee only a sub-optimal solution. But, in 
some special problems, which includes our problem at hand 
(as we shall establish later), the relaxation involved in SDR 
is exact and hence, the SDR technique becomes an efficient 
numerical tool to obtain the (precisely) optimal solution. For 
more details on the SDR technique, including sub-optimality 
analysis for special classes of problems, the reader is referred 
to the book-chapter (25). 

We proceed using arguments similar to (T3), ll26l . Define 



X = WW T £ 



dLxL 



so that problem ( |58| > is equivalent to, 
Tr [n M X] 



maximize 
x 



Jx = 



,Af, 



Tr [n JD X] + e 
subject to Tr [J7 P , m X] < m = l, 
rank X = 1, X h 0, 
Problem ^60\ is further equivalent to, 
find maximum J 
such that X(J) is nonempty, 

where X(J) is defined as the following (feasible) set, 

Tr [(Jft JD -n JN ) JT] + Ji 2 < 0, 
X(J) ^{X Tr [n P ,, n X] < P%, m = 1, . . . , M, 
rank X = 1, X h 0. 



(59) 



(60) 



(61) 



(62) 



Note that by definition, any X 6 # (Jopt) will correspond 
to the optimal weights that maximize the Fisher Information. 
Also note that X(J{) D Af(J 2 ) when < Ji < J 2 . Therefore, 
assuming we can test the feasibility of X(J) for some J, a 
simple bisection search over [0, Jo] can potentially yield J,p t 
with arbitrary accurac}^] (since J op t < Jo for finite power). 
However, testing the feasibility of X( J) is a difficult problem. 

6 Arbitrary accuracy is only of theoretical interest, since the solution to the 
feasibility problem \6l\ will have numerical errors. Hence, a more realistic 
stopping criterion is a fixed number of iterations, say 15. 
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Though the set of symmetric positive-semidefinite matrices 
is convex and the other M + 1 inequalities in |62) are also 
convex, X (J) is still not convex due to the rank constraint 
rank X = 1. Relaxing this constraint, we define, 

r Tr [(m JD -n M )x} + je < o, 

X R {J) 4 I X Tr [th>, m X] < Pg, m = 1, . . . ,M, 

(63) 

which is now a convex set (superscript R stands for relaxation). 
Because of this relaxation, we have X(J) c X R (J) for all J. 
We denote the solution to the new problem, 



find maximum J 

such that X R (J) is nonempty, 



(64) 



as J R pV so that J opt < J R pt . Hence, in general, the solution of 
the relaxed problem ( |64| i only provides an upper bound to the 
solution of the original problem ( |6Tj ). However, the following 
result establishes the fact that the relaxation is tight, i.e., J op t = 
J R pt for our specific problem. 

Proposition 5: (Semidefinite relaxation is tight): Assume 
Op be positive definite. Then, for any feasible J < J op t, 
X R (J) contains a rank-1 matrix. 

Proof: See Appendix [D] ■ 

The matrix Op is usually positive definite since it is 
composed of blocks of the matrix E x = E[xx T ]. Since 
X R (J) contains a rank-1 matrix, X R (J) n X(J) is non- 
empty, and feasibility of X R (J) also implies the feasibility 
of X(J), thereby making the relaxation tight (in terms J^L 



being equal to J op t). Note that the solution to ( |64| i can also be 
obtained using a bisection search. However, feasibility test of 
X R (J) is a convex semi-definite programming problem and 
hence can be performed efficiently (in polynomial-time). The 
computational complexity^] of such a feasibility problem is 
roughly 0(M 2 L 2 ), where M is the number of sensors and L 
is the number of non-zero collaboration weights. We have used 
the publicly available software SeDuMi as the optimization 
tool [28 1 for our numerical simulations. 

2) Closed Form solutions: Though numerical solution of 
the general problem ( |58) l can be obtained using the procedure 
outlined in Section |III-C1| (somewhat) closed form solutions 
can be obtained for some special cases. All the special cases 
discussed in this subsection will make use of the following 
optimization problem in its core, 



maximize Fi 
t 



(Em=l a 



— i Uj m L rn 



Z^m=l u m°m ' S 

subject to < t m < Cm, m = 1, . . . , M, 
given that a m and b m are positive for all m, 

which is known to have the following solution. 



(65) 



'Generally, numerical techniques for semidefinite programming are iterative 
in nature, see 1271 for a detailed discussion. In the dual formulation of the 
problem, each iteration solves a linear problem in M variables and -k( L ^ +1 ) 
equations, with the resulting complexity being 0(M 2 L 2 ). The number of 
such iterations is generally between 5 to 20 for many practical purposes 
(although theoretically, it is also a polynomial function). 



Proposition 6: (Solution of problem §65\ , see T\M ): Order 
the sensors based on the parameter d m = fc a " such that, 
without loss of generality, 



di > d 2 > ■ ■ ■ > dM- 



Define 



E 



k b c 2 +£ 2 

m= l u rn<^ m ' S 



Eft 
m=l a rt 



(66) 



(67) 



Also, define rh algorithmic ally as follows - keep checking 
in the decreasing order rh — {M, M — 1,...,2} whether 
^rfi-if? > c fh, and stop at the first instance this condition 
is satisfied. If ^i? 2 < ct, then m = 1. Then the solution to 

02 

problem ( |65| ) is F opt which is achieved when t = t op t, where 

2^m=l a m,C m J ^2 



F r 



opt 



l opt,m 



E 



b c 2 4- £ 2 



m= l "m^ m 



b. n 



< c m , in — rh 



M 

E 

m— m+l 

= l,...,m 

1,...,M. 



(68) 



Note that all the constraints are active (i.e., m = M or 



opt,m 



for all to) if and only if <&M-id.M > 1. 



Notation: Corresponding to constants a, b and c, we 
would denote the optimal solution ( |68] l of problem ( |65j ) 
as F p\{a,b,c). When mentioned in conjunction with 
F pi(a, b, c), the corresponding value of t will be simply 
denoted as t opt , i.e., without the arguments, to avoid repetition. 

Next, we provide some insightful examples. Some of these 
results will be used to derive the collaboration gain for 



homogeneous networks in Section III-D 



Example 4: For the problem with a) distributed topology, 
b) perfect information about observation and channel gains, 
and c) uncorrelated measurement noise, problem ( |58[ ) can be 
simplified as (note that w = diag(W^)), 



maximize 



subject to w m a x 



w T d\ag(b)w + £ 2 

< P G m = l 

,m — m' ' • 



where 



— grnh m , b m — SmC^, (T m — [S] 
2 i 2 1.2 

= a m +r] h m . 



(69) 



(70) 



Note that since a m and b m are positive, the value of w m at 
optimality has to be positive, hence the quadratic constraints 
in ( |69l > reduce to the linear constraints as in (65). The rest 

of the problem is solved by defining c m = 
applying Proposition [6] We obtain 

^v-iffi 7 ^ 2 




and 



opt 



E 1 



n-2 /j2 ^2 



M 

E 

m— m+l 



Ml 

<t1 



(71) 



Equation ( fTTj i is mathematically equivalent to the solution 
in lfl4ll . which was obtained in the context of maximum-SNR 
beamforming. This is because, for perfect OGI and perfect 
CSI, maximum-SNR also implies minimum-MSE. However, 
in the presence of observation and channel gain uncertainties, 
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the two problems are different. 



When all the constraints are active, the solution is 



The optimal Fisher Information in ( f7T| can also be com- 
pared to that in the cumulative-constraint case. With individual 
constraints, some sensors (those with lower "reliability-tO- 
Ti™ ^/<x^+») 2 'i£ 



power ratio 



'-, precisely sensors to 



to + 1,...,M) effectively "convey" the entirety of their 
individual Fisher Information to the FC while the other sensors 
(to = 1 , . . . , to) can convey only a fraction of their combined 
sum. In contrast, for the cumulative-constraint problem, dif- 
ferent fractions of individual Fisher Information reach the FC. 
However, in the infinite-power limit, both the cases converge 
to the centralized Fisher Information J cen t 



The previous example assumed perfect CSI and perfect 
OGI. The following extension of Example 3 (homogeneous 
network with equicorrelated parameters) illustrates the dete- 
rioration of performance with observation and channel gain 
uncertainties. 

Example 5: Consider a distributed topology with problem 
conditions similar to Example 3 (see (|54|i). To find the optimal 
distortion for this example, we proceed directly from ( (3T| , 
using the full-matrix notation 

(g T Wh) 2 



maximize J u 

w 

W=diag(ii>) 
subject to 



Tr 



EgWEyW 1 



rj 2 {g T Why + e 



WE X W T 



<P°m=l,...,M. 



(72) 



Noting the following identities, 

(g T Whf = a g a h glh 2 a (w T l) 2 , 
E g WE x W T ^ = go4{(l - a g a x )w T w 

+a g a x (w r l) 2 }, 



Tr 



WE X W L 



a 2 w 2 , where 



2 2 , 2,2 per 2 + a h r] 2 hl 
a x = a + Tj h , a x = 3 , 



problem f72\ is equivalent to, 



maximize F„ 



"g^hgo^o (w T l) 
9 2fJ x (1 - a g a x )w T w + £ 2 ' 



(74) 



subject to cr\w 2 m < P!^, to = 1, 



in the sense that F w is monotonically related to J w through 
J w = y — l~ ■ Problem |74} is similar in form to 

problem §65\ and the solution can be obtained by applying 
Proposition [6] We have 

2 1-1 

pa 1 



Jopt (P) 



1 



F opt (a, b, c) a h hl 



where 



(75) 



y/a g a h g h Q , b m = crlgl{l - a g a x ), 




Jopt (P) 



dot 1 ~ a g a x )J2Pm +£ 2 , pa 2 



a g a h g 2 h 2 a x 2 (j:VP^) a ^o 



(76) 



from which it is evident that the Fisher Information decreases 
with more uncertainty in observation and channel gains (lower 
values of au and a g ). This result will be useful later to derive 
the collaboration gain for this example. 

The following example considers a fully connected network. 

Example 6: For the problem with a) fully connected topol- 
ogy (A = 11 T ) and b) uncorrelated channel gain uncertainty 



(Hg is diagonal), we can start from fTZf , where the variable 
of optimization is the entire matrix W, 

> 2 



maximize Jw = 
w 



{g T Wh) 



Tr 



E g WE x W 7 



i] 2 (g T Whf 



subject to 



WE X W' 



< P, n , m = l,. 



,M. 



(77) 

Problem |77) can be simplified further based on the assump- 
tion of diagonal S g . However, the analysis is relegated to 
Appendix [E] and we state just the result here. 

Proposition 7: The optimal solution for Example 6 is 

if J 



J< 



opt 



p = J 



1 



1 



F op t (a, b, c) 



where 



(78) 



diag (S g 



and Cn 



and J = h 1 S h, S = S + r] 2 "£ h , as defined in ([50J. 
Optimality is achieved when the corresponding weights are 

1 (79) 



W opt = nt opt v , v = S h, k = 



J "(1 + V 2 J) 



Proof: See Appendix [E] ■ 
From the formula for optimal weights W pt in ([79]), we 
note that all the sensors have identical fusion rules (precisely, 
the vector v T ) but they transmit using different transmission 
power (according to Kt op {). It may be surprising to note that 
even though all the sensors are transmitting the same informa- 
tion coherently, they may still refrain from using the maximum 
power available (£ pt,m < c m , in general). This is because 
of the uncertainty in the channel (which is captured by the 
diagonal entries of S g ). If a channel is too uncertain, allocating 
a large amount of power to the sensor (even if the power is 
locally available) may not be helpful for inference. Indeed, 
all other parameters being constant, a higher magnitude of 
[Eg] m m results in a lower value of d m (see problem ( |65| l) 
and hence the power constraint is more likely to be inactive at 
optimality. However, for perfect CSI (S g = 0), all the sensors 
must transmit with maximum power available. 

We are now in a position to explicitly derive the formula 
of collaboration gain for homogeneous networks. It must be 
mentioned here that explicit formulas of CG for arbitrary prob- 
lems are difficult to derive and we only provide the example of 
homogeneous networks in this paper. As mentioned in Section 



II-G collaboration gain is a useful metric that indicates the 
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efficacy of spatial collaboration for estimation. 



D. Collaboration Gain for a Homogeneous Network 

This analysis of homogeneous networks is particularly il- 
lustrative since it is actually possible to derive closed-form 
expressions of CG for a wide range of individual (and also 
cumulative) power constraints. From the closed form Equation 
of CG, we can actually predict the conditions for which 
collaboration will be particularly effective. Specifically, we 
will derive the formula for collaboration gain and analyze how 
it depends on the problem parameters like size of the network, 
noise correlation, observation/channel gain uncertainties and 
power constraints. 

1) Theoretical analysis for a homogeneous network: Let P 
be the total power used in the network. For the cumulative- 
constraint problem, P is a sufficient descriptor of the power 
constraints. However, for the individual-constraint problem, 
we have M different power constraints {Pi, P2, . . . , Pm}, 
which makes it difficult to analyze the problem theoretically. 
Fortunately, for a large subset of those problems, only 2 
summary-descriptors of {Pi, P2, . . . , Pm} suffice to charac- 
terize the collaboration gain - namely, the cumulative power 
P, and a skewness parameter n, defined as, 



P 



sq 

P : 



P 



sq 




M 



P 



\m=l / m— 1 

As a example, we refer to Equation ( |76| ), where the distortion 
is seen to depend on only Pm (which is P) and (J2 y/Pm) 
(which is kP). It is easy to see (applying Cauchy-Schwartz 
inequality) that 1 < n < M, where the limiting values have 
distinct significance. The value k = 1 implies that only one 
of the sensors has all the power (Pi = P, say) while other 
sensors have no power at all P2 = • • • = Pm = 0. On the 
other hand, the value k = M implies that all sensors have 
equal power allocated to them Pi = • • • = Pm = 



km ( a normalize^ version of k) as 



M 



K M 



M 



T e [0, 1] 



Define 



(81) 



A stronger rationale behind naming the parameter k as skew- 
ness is provided by the following result, which shows that 
k is monotonically related to another quantity that explicitly 
enables power allocation in a skewed manner. 

Lemma 8 (Parameter k is an indicator of skewness): 
Assume that power available at the M nodes are 

Pi = Po, P2 =QPo,..., Pm = qM-^Po, q e [0, 1], (82) 

so that q is an explicit measure of dissimilarity (or skewness) 
among the power available at various nodes. Then (recall 
definition of k in (|80"1>), 

1 + y/Q + ■ ■ ■ + (y/o) 



k(q) = 



1 



+ Q 



M-l 



(83) 



The parameter km is similar in structure to the Chiu-Jain fairness metric 
1 29 1 used for congestion control in computer networks. 



is a strictly monotonically increasing function of g. Note that 
k(q = 0) = 1 and k(q = 1) = M. 

Proof: See Appendix [F] ■ 
The following proposition provides an explicit formula 
for the collaboration gain in a homogeneous network with 
equicorrelated parameters for both the individual-constraint 
and cumulative-constraint problems. We would use Examples 
3, 5 and 6 to establish this result, the derivation of which is 
relegated to Appendix |G| 

Proposition 9: (Collaboration gain for a homogeneous net- 
work): For the individual-constraint problem, let the node 
indices be arranged (without loss of generality) in such a way 
so that Pp < • • • < Pjf.j- Let P and k denote the summary- 



descriptors of {Pp, . . • , Pm} as P er Equation ( |80"1 >. Assume, 



pC 



(84) 



Then, all the power constraints are active at optimality, P m = 
P^, Vm, for both distributed and connected topologies, and 
the collaboration gain is 



CG = 



(I _ ±\ ( « l+(M-l)qg 

K 1 Ml \M l + (K-l)Qg 



1 



K-l 

M — 



1 a a ) (1 



1 



(80) where P g 



Pg(l+(K-l)ag) 

1 + 7 ' 



a x = 



1 4. (gzj>8 n 



(85) 



For the cumulative-constraint problem, the collaboration gain 
is given by Equation ( (85| ) by setting k = M, 

Proof: See Appendix [G] ■ 
Several remarks about Proposition |9]) are in order. Condi- 
tion ( |84| i is basically an assumption that the power constraints 
are not too skewed. In Equation ( |85] > note that all the quantities 
in parenthesis in the numerator denote quantities less than 1, 
while those in the denominator are greater than one, so this 
reaffirms the notion that collaboration gain is always less than 
1. Furthermore, 

a) Dependence on local-SNR 7, noise correlation p, and 
observation gain uncertainty a^: Collaboration gain increases 
with a decrease in a x , which means that CG increases as 
a) Noise correlation decreases (smaller p), b) observation 
gain uncertainty increases (smaller a n ), ar, d c ) local SNR 
decreases (smaller 7), provided p < (which is typically 
true for a problem involving moderately correlated noise and 
sufficiently certain observation gain). Hence, collaboration is 
more effective when the local-SNR is small, measurement 
noise is uncorrelated and observation gains are uncertain. 

b) Dependence on (normalized) total power Pg, power 
skewness K and channel gain uncertainty a g : To understand 
the effect of Pg, k and M on collaboration gain, we simplify 
Equation ( |85| l by considering the large- M asymptotic regime. 
Note that in general, an infinite number of sensors implies that 
the Fisher Information is infinite and distortion is zero for both 
distributed and connected topologies, which is a trivial regime 
to consider. Towards the goal of analyzing regimes that incur 
only finite distortion in the asymptotic limit, we consider two 
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cases, as listed below. 

First, we would consider the fixed- K-large-P regime, which 
signifies that the effective number of powered nodes are not 
increasing with M (recall that k = 1 implies that only one 
node has all the power, regardless of M), i.e., most of the 
nodes are auxilliary nodes that provide their information to 
the powered nodes which then communicate with the fusion 
center. Hence, even with a large transmission power the 
distortion at the FC is finite, which makes this regime non- 
trivial. From ([85|, we note that 



lim CG 



(1 - a x )a g K 
a g (n - 1)) (1 + a x a Q (n - 1)) 



(86) 



With the additional technical assumption that a Q > 1 * 
(which basically means that the channel gains are sufficiently 
certain), the collaboration gain (Equation ([86])) decreases as k 
increases, and hence is maximum when k — 1, at which point 



CG = (1 - a x )a Q 



(87) 



That CG decreases with a x has already been discussed in 
the previous remark, and also applies for this regime. We 
conclude that in the fixed-K-large-P regime, collaboration is 
highly effective if there are only a few powered sensors (small 
k), and the channel gains (higher a g ) are fairly certain. 

Next, we consider the fixed- (k /M) -finite- (P x M) regime. 
Here, the normalized skewness parameter Km is kept constant, 
which means that effectively, the number of powered sensors 
k increase linearly with M (note the contrast with previous 
regime). To keep the effective Fisher Information (and result- 
ing distortion) finite, the total available power P must scale 
inversely proportionaj^] to M. Let Pm — Pg^I denote the 
normalized total power, which is a finite constant. From ( [85] l, 
we can derive that 



max lim CG 

P M M-hx 
P 9 M=P m 



1 - V^x 
1 + v /a x ' 



when P, 



1 



M 



(88) 



This implies that collaboration is more effective for smaller a x , 
a fact that was established earlier as well. We conclude that in 
the fixed-(ft/M)-finite-(P x M) regime, collaboration is most 
effective when the normalized operating power is a particular 



l 



Moreover, P 



A I 



finite quantity, precisely Pjj^ = 

increases as the power constraints get more skewed (smaller 
Km) or the channel gains get more uncertain (smaller a g ). 

The assertions in Proposition [9] and the subsequent dis- 
cussion are illustrated in Figure p] for an example with the 
following problem parameters rf= |, a 2 = 1, p = 0.1, 
go = ho = 1 and a g = a n = 0.9. Contours of the actual 
collaboration gain are computed numerically using procedure 



outlined in Section III-C and displayed for a wide range of 
cumulative power P g , number of nodes M and skewness of 

'Another asymptotic domain that is conceptually different but otherwise 
would yield identical results is to keep the power P constant and let the 
channel gains g scale at the rate of . The argument here is that, though the 
network size increases with M, the channel capacity of the effective multiple- 
input-single output (MISO) channel induced by the M sensors, precisely 



power constraints. To illustrate the fixed-K-large-P regime, 
we have used n = 1 (only one sensor has all the power) 
in Figure ga). To illustrate the fixed- ( K /Af) -finite- (P x M) 
regime, we have used km = {1/3,2/3,1} in Figures |5^b), 
|5jc), and [5jd) respectively. To simulate a particular skewness 
K, the local power constraints are generated in accordance with 
( |82| ) by finding the corresponding value g through bisection 
search (recall that g and k are monotonically related, as per 
Lemma IS). The active-constraint condition of Equation ( |84| i is 
depicted through the dotted line in Figures |5jb) and|5|c). The 
portion of the figure to the right of the dotted line suggests 
that one or more of the constraints are inactive, while the left 
half denotes the region where all the constraints are active 
and consequently Equation ( [85] l is the accurate measure of 
collaboration gain. No dotted lines appear in Figure [5ja) and 
|5Jd) because all the constraints are trivially active in both the 
cases, although for different reasons. For k = 1, there is only 
one sensor with all the power and hence it must transmit with 
full power, which explains Figure [5J a )- For k = M, we have 
equal power allocation (P^ = P 

reduces to P c < (VpC + — — 

trivially satisfied for all P c . 

For the problem parameters mentioned above, we calculate 
that a x s» 0.37. For the fixed-K-large-P regime, theoretical 
justifications predict that the maximum gain possible (across 
various problem conditions) is (1 — a x )a g ps 0.57, which 
can be confirmed from the contours toward the top-right 
corner of Figure |5ja). For the fixed- (k/M) -finite- (P x M) 
regime, theoretical predictions yield that the maximum gain 



say) and condition ( |84| ) 

f 2 \ 2 

5 1 which is 



is 



l--v/5? 



, is held constant. 



0.24, which can be validated from the innermost 
contours of Figures |5jb), [5jc) and[5|d). Note that the contours 
shifts to the left as km increases. This is due to the fact that the 
normalized power required to achieve maximum collaboration 
gain decreases as km become larger, a fact also discussed 
above. In conclusion, for this particular problem instance, upto 
57% (and 24%) of the distortion performance can be recovered 
using collaboration in the fixed-K-large-P regime (and fixed- 
(/•c/M)-finite-(P x M)) regimes. We have established this fact 
using both numerical results and theoretical insights. 

2) Random geometric graphs: To demonstrate how the 
distortion decreases with increasing collaboration, we consider 
the following simulation setup. The spatial placement and 
neighborhood structure of the sensor network is modeled as a 
Random Geometric Graph, RGG(7V, r) l30l . where sensors 
are uniformly distributed over a unit square and bidirec- 
tional communication links are possible only for pairwise 
distances at most r, i.e., the adjacency matrix is A such 
that Aij = l^j < r i. Correspondingly, the cost matrix is a 
{0,oo} matrix with the (i,j) th element being zero only if 
di,j < T, otherwise being infinity. We assume N = 20 sensor 
nodes and gradually increase the radius of collaboration from 
r = (signifying distributed topology) to r — v2 (signifying 
connected topology, since the sensors are placed in a unit 
square). The simulated sensor network is depicted in Figure [6] 
with collaboration radius r = 0.2. In general, for < r < y2, 
the network is only partially connected as in Figure [6] 

We simulate a homogeneous network with the following 
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(a) Only one sensor has power [km = 0) 




0.01 1 100 



Cumulative Power, Pg 
(c) Mildly skewed power constraints [km = 2/3) 




0.01 1 100 



Cumulative Power, Pg 



(b) Highly skewed power constraints (km = 1/3) 




0.01 1 100 



Cumulative Power, Pg 



(d) All sensors have equal power (km = 1) 




Fig. 5. Collaboration gain in a homogeneous sensor network with skewed power constraints. 
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0.1 





0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 

Fig. 6. Random Geometric Graph with 20 nodes, used for example in Section 
|III-D2| Ed ges are shown for pairwise distance less than 0.2. The radius of 
collaboration is depicted for sensors 1,2,3 and 17. 



parameters a 2 = 1, p = (independent noise), g = h = 1 
and a g = a n = 0.9. To contrast the effect of prior uncertainty 
on collaboration gain, we simulate two different variance of 



the prior, f] 2 =0.1 and 0.5. To illustrate the effect of power 
constraints, we simulate a wide range of both cumulative- 
power and skewness. We simulate three skewness conditions 
- km — 0.5,0.75 and 1, the value of 1 implying equal power 
allocation. The three values of cumulative-power that were 
simulated were a) Pg = P* = 



0.75x20 a g Jai 



(recall from the 

discussion in Section |HI-D 1 1 that, for k = 0.75, this is a high- 
CG operating region), b) Pg = P* /A (depicting the low power 
regime), and c) Pg — 4P* (depicting the high power regime). 

The simulation results are depicted in Figures |7]-(a) and (b) 
for the two values of prior uncertainty rj 2 = 0.1 and rj 2 = 0.5 
respectively. Corresponding to these values of rj 2 , the infinite- 
power distortion Do, the maximum possible collaboration gain 
CG* and the corresponding operating power P* for km = 
0.75, are calculated by using d55j and 



2 I 



V 
0.1 
0.5 



D CG* P* (k m = 0.75) 
0.07 0.29 0.13 
0.04 0.56 0.25. 



(89) 



With varying power availability and varying extent of 
collaboration, the resulting distortion varies between D € 
(Do,rj 2 ), where the ranges are (0.07,0.1) and (0.04,0.5), 
for the prior variance ?/ 2 = 0.1 and ?/ 2 = 0.5 respectively. 
To compare the effect of collaboration across the two differ- 
ent problem conditions, we depict the normalized distortion 
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• e-Pg 
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-P g = P*/4,k m =0.75 
P g = P*/4, KM = l 
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-P g = P«, 
P E = P*, 



km = 0.50 
k m =0.75 
k m = 1 



• ♦ - Pg =4P*, K M =0.50 



-P g =4P* 
Pe = 4P* 



km = 0.75 

km = 1 



Connected 



♦ 4- -4-4- -4- 



0.3 0.4 0.5 0.6 0.7 
Radius of collaboration, r 



Fig. 7. 



(a) rp = 0.1 (b) T] 2 

Improved in distortion due to increasing collaboration among 20 nodes in a random geometric graph. 



: 0.5 



,p m Figures pl-(a) and (b). Note that in this normalized 
scale, the collaboration gain is simply the difference between 
the right-most (distributed) and the left-most (connected) ends 
of a curve. The efficacy of collaboration, as indicated by 
the downward slope of the curves, is clearly demonstrated in 
Figures and (b), where it is noted that a large part of the 
overall gain is achieved only with partial collaboration, i.e., 
the distortion tends to saturate beyond a collaboration radius 
of r w 0.4. Hence, though the collaboration gain, as defined in 
([30j, requires a fully connected topology, a large part of that 
gain can be realized with only a partially connected network. 
Also, we note that the efficacy of collaboration diminishes 
when the operating power is too small (P g = P*/4) or too 
large (P g = 4P*), indicating the fact that collaboration in a 
network should be used judiciously, specially when there are 
costs associated with it. Finally, we note that the efficacy of 
collaboration is higher when the prior has a lower variance 
(the curves in Figure |7]-(a) has more downward slope than 
those in Figure |7]-(b)). This observation is explained directly 
by the comments following Proposition [9] where we argued 
that collaboration gain increases with decreasing local-SNR. 

IV. Collaboration with finite costs 



In Section III we have solved the optimal collaboration 
problem for the situation when the cost of communication 
for each link is either zero or infinity, i.e., Cjj € {0,oo} 
(also termed as the ideal case). In this section, we address 
the general problem where communication may incur a non- 
zero but finite cost, i.e., < Cij < oo (also termed as the 
finite-cost case). Unlike the ideal case, finding the globally 
optimal solution for the finite-cost case is a difficult problem 
and there are no known numerical techniques that efficiently 
solve this problem. In this section, we outline an efficient 
(polynomial-time) numerical procedure that obtains a locally 



optimal solution to the finite-cost problem. We first describe 
our iterative solution for the individual-constraint problem. 
The cumulative-constraint problem follows from similar ar- 
guments and is described next. Lastly, solutions to both the 
problems are demonstrated using numerical simulations. 

A. Individual power constraint 

We propose an iterative solution as follows. Let the collab- 
oration topology and transmission power availability (for all 
N nodes, in vector form) at iteration i be denoted by Aj and 
p l ' a " s respectively. Note that the transmission power avail- 
ability is the difference between the original power constraint 
and the collaboration cost due to the topology A,. For the n tb 
node, it means 



M 



transl 



(90) 



Note that for the auxiliary nodes n = M+l, . . . , N, the trans- 
mission power availability [P'/ ans ] n does not mean much, 
since they cannot transmit to the FC anyway. For those sensors, 
it is best if they use their entire resources for collaboration. Re- 
call that the optimal distortion corresponding to any topology 
A and transmission power constraint p trans can be obtained 
from the discussion in Section [ill] Denote such a distortion 
by D^ pX (p trans ) . We start with a distributed topology, i.e., 
Ax = [Jjvr|0] and follow a greedy algorithm. At iteration 
i, we evaluate the distortion performance corresponding to 
all incremental topologies of the form Aj + E(m,n), where 
E(m, n) is an all-zero matrix except for the (m, n) th element, 
which is 1 (signifying an incremental n — > m link). There 
are MN — nnz(A.j) such possibilities for selecting E(m, n), 
each corresponding to a link that is current not being used 
(equivalently [Aj] is zero). The number of such possibilities 
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may even be lesser if there is not enough power to make a 
link possible. For example, if C mn > [P l ' ans ] , then the 
node does not have sufficient power to use the n — > m link 
for collaboration. Among all such possible links, let n* — > m* 
denote the link that provides the best distortion performance. 
Then the iteration is concluded by augmenting the topology 
with the edge n* —> m*. Thus, in compact notations, each 
iteration is represented as, 

^Ai+E(m,n) ( D trans 



(m*, n*) = arg 



mm 

C m „<[P trans l 



D, 



opt 



(91) 



where e„ is an all-zero vector with the exception of 71 th 
element, which is 1. The iterations are terminated when one 
of the following conditions is satisfied, a) there is no feasible 
link, b) maximum number of iterations has exceeded a pre- 
specified limit or c) there is not enough increment in (relative) 
performance after a particular iteration, say, for a pre-specified 
S, if 



n A i+ i / ptrans \ 



D Ai (P 
^opt \^ 



trans \ 



< 6. 



(92) 



A rough estimate of computational complexity can be estab 



lished as follows. Recall from Section III-C that computing 
D^ t (P) is roughly O (M 2 i 2 ), where L = nnz(A). Assume 
that the algorithm gets terminated after adding O(N) links, 
which is to say that each node communicates with a fraction of 
its neighbors at optimality. Since we start with the distributed 
topology we have L = O(N) for all the iterations. Since 
each of the iterations (as in ( |9"Tj )) require approximately MN 
function evaluations, the total complexity of the finite-cost 
collaborative power allocation problem is 



0(M 2 N 2 ) x 0(MN) x O(N) = 0(M 3 N 4 ). 



of -D 0I 



evaluations 
per iteration 



of iterations 



It must be emphasized here that ( |93) is only a rough (but 
practical) measure of complexity and is not a rigorous bound. 
It assumes a fixed number of iterations to solve a semidefinite 
optimization problem (see discussion in Section |TH-C[ ) and also 
sufficiently high collaboration costs so that only O(N) links 
are added starting from a distributed topology. 



B. Cumulative power constraint 

To solve the cumulative power-constraint problem, we pro- 
ceed on similar lines. Let the cumulative power constraint 
be P c and the cumulative transmission power availability at 
iteration i be denoted by i^ rans , so that 



M N 



P 



trans 



(94) 



m— 1 n— 1 



Denote the optimal distortion corresponding to any to pology 
A and transmission power constraint P trans (see Section III-B 1 
as D^ x (P trans ). Starting from a distributed topology, the best 



collaboration link n* 
according to 



/ * * 

(m , n 



mm 

m,n 



arg 

[A. 

C mn <P] 
A-i -\- Pjm*n* 1 



is selected at each iteration 



n Ai+E(m,n) / p trans r* 
^opt V% ~ °« 





.trans 



(95) 



with the stopping criteria being similar to that described in the 
previous subsection. As regards to computational complexity, 
recall from ( |HI-B| i that the complexity of computing D^ { (P) 
is O (L 3 ), where L = nnz(A). Assuming O(N) iterations 
(as in the previous case) the overall complexity for the finite- 
cost-collaborative cumulative-constraint problem is roughly 

0(N 3 ) x O(MN) x 0(N) = 0(MN 5 ). (96) 



evaluation 
of Dam 



evaluations 
per iteration 



number 
of iterations 



C. Numerical Simulations 

To demonstrate the efficacy of collaboration in finite-cost 
scenarios, we consider a random geometric graph of M = 
N = 10 nodes. As in the previous examples, we consider a ho- 
mogeneous network with the following parameters rj 2 = 0.5, 
a 2 = 1, p = (independent noise), go — ho — 1 and a g = 
ah = 0.9. The collaboration cost of link m — >• n is assumed 
to increase quadratically with the distance between nodes m 
and n. This is because the gain of a wireless channel is 
often inversely proportional (upto a constant exponent) to the 
distance between a source and a receiver [31 1. Consequently, 
to maintain a reliable communication link, the transmission 
power has to be scaled up accordingly. In particular, we 
assume 



c d 



2 

m,n ' 



(97) 



(93) where e? m .„ denotes the distance between nodes m and n 
and Co is a constant of proportionality. For our numerical 
simulations, we consider a wide range of cq, specifically 
Co € [10 -4 , 10 4 ], to depict the effect of collaboration cost on 
the distortion performance. A lower collaboration cost in effect 
allows the network to collaborate more and thereby reduces 
the distortion. We consider two magnitudes of total operating 
power (namely, P g = 1 and 3) for both the individual- 
constraint and cumulative-constraint cases. For the individual- 
constraint case, we consider two skewness conditions for the 
power-constraint, namely km — 0.5 and — 0.75. The 
corresponding distortion curves are shown in Figure [8] For 
very low values of the cq, the distortion converges to that in 
a fully connected network. Similarly, for very high values of 
the Co, no links are selected for collaboration, and the network 
operates in a distributed manner. Since in our example, the 
network is homogeneous, equal power allocation among nodes 
is also the optimum power allocation for a cumulative-power- 
constrained problem. Consequently, the performance of the 
cumulative-constraint problem (dash-dotted lines) is always 
better than that with individual power constraints (bold and 
dashed lines). In conclusion, Figure [8] shows that in a homo- 
geneous network, the estimation performance improves with 
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higher operating power, less skewed power constraints and 
lower collaboration cost among sensors, as expected. 



0.35 

q 0.3 h 
I 

Q 

~— ' 0.25 



i 0.15,,-.. «. 



-e— P g = I, n M = 0.50 
. e -Pg = I, k m = 0.75 
-o- Pg = 1, Cumulative 
-^Pg = 3,re M = 0.50 
. ■ - P g = 3, re M = 0.75 
-a- Pg = 3, Cumulative 




0.05 
10 



10" 10' 
Collaboration Cost, c.q 



Fig. 8. Estimation with finite collaboration cost - an example with 10 sensor 
nodes. 



V. Conclusion 

In this paper, we addressed the problem of collaborative 
estimation in a sensor network where sensors communicate 
with the FC using a coherent MAC channel. For the scenario 
when the collaborative topology is fixed and collaboration 
is cost-free, we obtained the optimal (cumulative) power- 
distortion tradeoff in closed-form by solving a QCQP prob- 
lem. With individual power constraints, we have shown that 
the semidefinite relaxation technique can be used to obtain 
precisely optimal numerical results. Several special cases are 
presented as examples to highlight the problem conditions for 
which collaboration is particularly effective. Through the use 
of both theoretical and numerical results, we established that 
collaboration helps to substantially reduce the distortion of 
the estimated parameter at the fusion center, specially in low 
local-SNR scenario. As future work, we wish to explore the 
collaborative estimation problem when the parameter to be 
estimated is a vector with correlated elements. The scenario 
when collaboration is erroneous, as mentioned earlier, is also 
important. 



Appendix A 
Proof of Proposition|2] 

Since w = vec(W) for a connected topology, we obtain 
tip = E x <g) I, G — I ®g and 



OjD — 



E x < 



E g ) - ifGhh T G J 
E„) + GEG T . 



(98) 



Substituting the appropriate values in ( |44a| i, we have (assum- 
ing all the inverses exist) 

-l 



J opt (P)SU T G T (GEG J 



Sir 



Gh 



E + r P 
i 



h 



h 



(d) 



J 



1 



1 + if J 



(99) 



where step (a) follows by defining Op = E x g3 S g where S g 
is already defined in < [50| >. Step (b) follows from arguments 
similar to those used in ( |46] > and by defining (and subsequently 
simplifying) T> as 

f P = (g t o p x g) 

-l 



where Q = g T E g 1 g, 



(100) 



step (c) follows from the fact that E x = H + rfhti 7 \ and step 
(d) follows from the definition J = h T H h and following 
identities involving rank-1 updated matrix inverses. For any 
scalars a^0 and /?, vector p and invertible matrix Q, 

tn-1 Q 1 PQ- 1 pp T Q 1 



(aQ + Ppp 1 



P T (aQ + f3pp T ) 'p = 



a[a + pQp) 

= p T Q X P (102) 



From equations ( |44b| i and discussion in Example 1, the 
optimal weights are, 

«? op t oc Op X GE Te + Tp\ 1 h, (from (g^J) 



oc J~2 P Gh 



E^ 1 (g, Eg 



(h®g) 



= (E^h) ® ( E g g 



(103) 



which implies that W opt oc E g gh T E x 1 . Step (a) follows 
the fact that (e + f p) hoc E^\ (see ( fTOl) ). 



From Corollary 2.3.5 of 021 . the sum-rate required to 
encode a single-dimensional real-valued Gaussian source with 
variance rj 2 , observed through the vector h and Gaussian 
observation noise with covariance E, in such a way that 
reconstruction incurs an average distortion of at most D, 
satisfies 

i2w > \ log -^j^ , where A = f J ° . (104) 
2 D - D I + ^Jq 

Since, for a fixed sum-power P, the sum-rate has to be lesser 
that the (centralized) capacity of the coherent MAC channel, 
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i.e., R lot < C, where C = \ log(l + |Jg|| 2 P 4 ), we obtain 

,2n v V 4 Jo 



(105) 



(D-D )(l + ri 2 J )- 
Replacing D by J (recall, J = — \) and after some algebra, 



we obtain, 



J< Jo 



1 + 



1+jfJo 
\\9\\ 2 P 5 



(106) 



But the right hand side is precisely the distortion achieved by 
a connected network (see |50} and note that Q = ||g|| 2 P^ for 
S g = 0). This establishes the information theoretic optimality. 

Appendix B 

Lemma 10 (An inequality): For any TV-dimensional vector 
p and N x N symmetric positive definite matrices A and B, 

(107) 



P T (A 



B) 'p 



> 



p T A 1 p 



p T B L p 



Proof: Since A,B e S++, A ? BA ? € S++. Define 
by U and A the following eigendecomposition A^ 11 BA ^ = 
UAU T . Hence A„ > 0,Vn. Define q 

q T q = p T A x p 



U T A 2 h. Note that 



q T A 1 q — p T B p, and 
q T (I + A)- 1 ^ = p T (A + By 1 p. 



(108) 



Hence, to prove ( |107| i, it suffices to show that 
1 1 1 



Z-<n=l 1+A n L-m=\ Z-<n=l A 

or equivalently, with a„ = and b n = 1 1^ A " 



JV TV AT 

XI 9 « X lla n b n > 

n—l n=l n—1 



N 
71=1 



(109) 



Since A„ > 0,Vn, both a n and 6„ are decreasing functions 
of A„. Hence inequality ( |109[ > follows from the Chebyshev's 
(sum) inequality (page 240, Equation 1.4, [33)). Equality holds 
if and only if, for all indices k for which qt 7^ (denote such a 
set by ixnz(q)), the eigenvalues are similar. That is, iff A& = A, 
Vfe G ixnz(q). ■ 

Appendix C 
Proof of Proposition!!] 



We would use Equation ( |44a| i. To start with, we note that 
Gh is a multiple of 1 and both matrices fijo and J~2p has 
an eigenvector as 1, where 1 has dimension L = MK. In 
particular, careful inspection of ( |3"3"j ) (the elements of matrices 
Eg and E x take two distinct values, diagonal and otherwise) 
yields 

Gh = g a h 0y /a g a h l, 

n P l = a\ (1 + (K - l)a x ) 1, and 

n JD l= [4gl{l + (K-l)(a x + ag) 

+ (MK -2K + l)a x a g } - r] 2 g 2 h 2 a g a h MK] 1, 



where a 2 = a 2 (l + 7), a x = and 7 = 9_§a. Based 

on the above equations, define scalars 0, /i, ^ be such that 
Ghh T G T l = 01, O p 1 = /il and f2 JD l = (f - r/ 2 ^)l, in 
particular, 

= g 2 hla g a h MK, 

fi = c 2 (1 + (K - l)a x ) , and 

^ = <t x 2 9o {l + (i^ - l)(ax + a g ) + (MK - 2K + l)a x a g } , 

(HI) 

From d44ab, we therefore obtain 



Jo P x(P) = 



v + -p — ?y 2 </>' 



(112) 



which when simplified further leads to ( |55| >. Since 1 is the 
corresponding eigenvector, we also have w op i oc 1, i.e., the 
sensors just average all the observations. 

Appendix D 
Proof of Proposition!!] 

Our goal is to show that for any feasible J < J op t, X R (J) 
contains a rank-1 matrix. Specifically, we will show that 



X = arg max Tr ITip-XI 

xe* R (j) 



(113) 



which is the global optimizer to the (convex) semi-definite 
optimization problem 



minimize Tr [i~2pXl 
x 

subject to Tr [( jri JD - fJ JN ) X] + J£ 2 < 0, 
Tr {n P , m X]<pC,m=l,,..,M, 
-x ■< 0, 

is rank-1. The Lagrangian of \\ \A\ is given by, 
C(X,a,/3,Z) =Tr [il P X] 

+ a(Tr [(m JD -n M )x] + Ji 2 ) 

M 



(114) 



£ (i m (Tr [n Pjm X] - P£) 



(115) 



m— 1 



-Tr [XZ] 
with the dual problem being, 



M 



maximize a J^ 2 — (3 m P% 

a, /3 ^ — ' 



subject to Z = rip + a ( JJIjd 

a > 0, /3 > 0. 



71/ 



JNJ 



^ 0, 



(116) 



Also, we have the following complementary conditions (let 
tilde denote respective values at optimality), 



a \ Ti 

An(Tr 



( Jf2jD — Ojn) X 



$7 



J?) = 0, 

0, m = 1. . 



Tr XZ =0. 



(117a) 
,M, (117b) 
(H7c) 
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Without loss of generality, let X = YY (such a decompo- 
sition is possible since X is symmetric positive semidefinite). 
Also denote the columns of Y as wi € M. for I = 1,2, ... ,L, 
so that Y = [wi, w 2 , . . • , Wl\. From (1 17c) , we have 

L 



Tr 



XZ 



E 

1=1 



<-p — 

Wi Zwi 



o, 



(118) 



which coupled with the fact that Z y implies that 

Zwi = 0, for alii = (119) 



Thus, from definition of Z in ( |1 16[ ), we conclude that wi-s 
are the generalized eigenvectors that satisfy 



(n + a („ 



$7 + 5 ( Jfljo - tt ) J w l = L , VZ, with 

M 

fi = Hp + 2J ^m^P.m, 
m— 1 

i = Gft. (note ri JN in ((33])). 
Hence it follows that for all Z, 

ft + SJfljo) - att T ) wi = L , 



(120) 



n + ajn JD ) att 1 )wt=0 L 



(121) 

=*> oc (^ft + 5 JO JD J t, (122) 

where ( |121| i is because ftp is positive definite and hence the 
matrix ( SI + aJttjoj is also positive definite and invertible, 

and ( |122| i follows from the fact that both a and t T Wi has to 
be non-zero to satisfy (120) . We thus conclude that 

wi is unique upto its norm, V/, 

=>• X is a rank-1 matrix, 

thereby establishing Proposition [5] 



Appendix E 
Proof of Proposition^] 



Problem (77] ) is equivalent to 



maximize Jw = 
w 



(g T wh) 



subject to 



Tr 

WE X W T 



E g WE x W q 



(124) 



<P^,m = l,...,M, 



in the sense that J\y and J'w are monotonically related 
through Jw = 1 _f 1 2 / j w an d W^opt is the same for both 
problems. This is further equivalent to 



maximize Jy 
v 



subject to \\v 7 



Tr 



(g T Vh x 

EnVV 2 



(125) 



by defining V, v m and h x such that 



V = WE} = 



, and h x = E x 2 h. 



(126) 



With a goal to reduce the number of optimization variables 
from MN to M, we define the matrix transformation V — > 
V x as one that retains the norm of its individual row vectors 
but otherwise aligns the rows to h x , i.e., 

V x = t-rr-^-rr, t rn = \\v m \\ , SO that 
"■X 



<7 - \\h II 2 ( gT *) 



(127) 



We would need the following result to proceed further. 
Lemma 11: When S g is diagonal, 

Jv < Jv x , for any V. (128) 

Proof: To prove Lemma (IT) , we will show that 



IM 2 



Tr 



||g T v|| 



£2~t T E g t + e \\h x \\ 



(129) 

where (a) follows from definition of Jy in (77] ) and Cauchy- 
Schwartz inequality implying (g T Vft. x ) < ||<jr T V"|| 2 ||/i x || 2 , 
and the last equality is due to definition of Jy % in (127) . Hence 
it remains to prove (b), which can be established by showing 



Tr 



(t T £g*) gg T - (g T t) S g J VV T \ < 0, (130a) 
and e (\\9 T V\\ 2 - (g T t) 2 ) < 0. (130b) 
Define g, V and its aligned equivalent V x as 



g = -Z g ig, so that 



Fx =* " 



h x \ 



where t m = ||w m || , or 



(131) 



t = Sgi (since S g is diagonal). 



Note that Tr 



VV 



\t\\ 2 . Condition (130a) is therefore 



equivalent to snowing 



Tr 

or equivalently 



ft\\ 2 gg T - (g T i) 2 )vv 



<o, 



g'V - (g't) <0, (132) 



which is similar to condition (130b) . Thus it remains to 
establish (130b) , which is true because 



9 T V 



M 



/] 9mV r 



(b) 

< 



M 



22, 9m 11^™" 



(c) / T.\ 2 



(133) 



where (a) and (c) are due to definitions of v m and t re- 
spectively, and (b) is due to Cauchy-Schwartz inequality, 
v m w n \\ v m\\ \\ v n\\ f° r a U 1 S\ ni,n < M. This 

completes the proof. ■ 
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Note that Jv x is a function of the vector t, whose elements 
are non-negative (since t m is a norm). Therefore problem 



125), in conjunction with Lemma 11 is equivalent to 



J° = v 2 -rf<j>M therefore the denominator term of ( |3"0) is 
rj 2 — Do = ^-^) the collaboration gain can be simplified as 



maximize Jy % (t) , 

subject to t 2 m < P^, m=l, 



(134) 



CG 



Mm— 1^2 I p Mui—Hs 

Mm ' i Mm 













P( "2 J 







(140) 



through t he rel ations V opt = *optp^y and Jv op , = Jv* (*opt) 
Problem ( |134| i is further equivalent to 



(9 * 

maximize = -=h , 

2 C 

subject to t m < P m , m = 1, . 



(135) 



,M, 



Each of the fragments can be simplified further, M ^^ 2 = 
i(M - 1)(1 - a x ), = g ( M - 1)(1 - a g )(l - 

«x). t = g g(l + (M-l)a g ) ^ ^7 = SO 2 (1 + ( M - 1)«0<*)- 

Replacing these fragments in ( |140| >, defining P g = -P^o an( l 
dividing both numerator and denominator by (1 + P g ) leads 
to Equation (|85) (with k replaced by M). 



through the following relations between variables Jv x (t) = 
|| fa x || 2 Y+pr, which proves Proposition 



Appendix F 
Proof of Lemma[8] 



We start by noting that for p€ (0, 1) 

M \ 2 

Q 2 \ l-i? 



For the individual-constraint case, we would use Examples 
5 (distributed) and 6 (connected) to compute the collaboration 
gain. First we show that all the constraints are active for 
both the distributed and connected cases if condition ([84} is 
satisfied. If we apply Proposition [6] to a homogeneous problem 
with ai = ■ ■ ■ = aM and b\ = ■ ■ ■ = &m> the active constraint 
condition ^m-i^m > 1 simplifies to 



k(q) 



1 



1 — Q2 



1-g 



M 



1 - g 2 1 + g2 
I + g 2 1 — g2 



(136) 



T M-1 2 

M-l 
m=l ' 



We would show that > for g G (0, 1). From ( [136) , 



(141) 







Mg- 



Q 



M-i 



d«(g) 

(l + ^) 2 (l-^ 
the numerator of which can be rearranged as 

e -^{i-e M )-M Q ^{i- e ) 

~2 



Mg- 



(137) 



= e -*(i-e)x; 



(138) 



m— 1 



For the distributed case (Example 5), we refer to problem 

( |75) to find that b m = a 2 g 2 (l — a g a x ) and c m = V^"* \ 
so that inequality ( |141) explicitly evaluates to condition (84) . 
For the connected case (specialized version of Example 6 for 
homogeneous parameters), we refer to Equation ( |78) to note 

'P^, so that inequality 



(142) 



which is clearly true if condition ( |84) holds (since a x € [0, 1]). 





that 6 TO = 3g(l — ctg) and 






(|141) evaluates to 




M-l \ 








jL-^m— 1 V 


ApC 



which is evidently a positive quantity. This completes the 
proof. 

Appendix G 
Proof of Proposition^ 

We start with the cumulative-constraint case, for which we 
will use the results from Example 3. From Equation ( |1 12) and 
corresponding to the distributed (K = 1) and connected cases 
(K = M), we denote the constants {/i, v} with subscripts 1, 2, 
as {/ii,^i} and {1^2,^2} respectively, i.e., 



As regards collaboration gain, we proceed as we did 
in the cumulative case. For the distributed (Equation ( f76) ) 
and connected (Equation ( |78) with homogeneous parameters) 
cases, we can rearrange the terms of J pt(P) to express 
them in the form of ( |139) , where the various constants 
are now <fi — 5o^o a g a h K ' v \ = G \do (l + ( K — l)o: g a x ), 
/ii = a 2 , v 2 = a 2 g 2 (l + (k - l)a g ) (1 + (M - l)a x ) and 
P"i = a x (1 + (M — l)a x )- However, in this case J is 
obtained not just by letting — > 00 (thereby letting /1 
vanish), but also by setting k = M in both <fi and v, i.e., 



(k=M)M 



v 2 (K,=M)-if<j)(K,=M)M 



. Therefore the denominator term 



V\ + # - T] 2 (j) 



and 



of m is V 2 - D = = which 



4>M 



(139) 



u 2 (k,=M) ~ v 2 K (l + (M-l)a g )' 

is just a scaled version of Adjusting ( |140) for this scaling 
and rest of the derivation remaining similar, we obtain 



V2 



where <j) = g 2 hla g a^Al, v\ = u\g\ (l + (M — l)a g a x ), 
Mi = oj, = ^xfo (! + (M - l)a g ) (1 + (M - l)a x ) 
and = c x (1 + (A/ — l)a x ). Applying the corresponding 

distortion terms (note D — ( \ + J ) ) in (|30), (note that 



CG 



K (l + (M-l)q g ) / Mm-^2 , p M^i-i. 
M (l+(«-l)a g ) ^ iWjiii ^ Mfii 













P 5 







, (143) 



which is precisely Equation ( |85) , thereby completing the 
proof. 
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TABLE I 

Main results: cumulative power constraint (examples 1, 2 and 3) 



Cases 



Conditions 



Optimal (equivalent) 
Fisher Information 



Optimal weights are 
proportional to 



(A) 
General 



J = h T G T | JD 



Gh 



w cx I S~2. 



'JD 



~P~e 



Gh 



(B) 
Perfect CSI 



./ = /»''( E + y- ) h 



w cx f2p X Gr P E 



ft 



(C) 

Perfect OGI, 
Perfect CSI, 
El 
(D) 
Distributed, 
Uncorrelated, 
Perfect OGI, 
Perfect CSI, 
(1 



S h = 0, 
£ n = 



A = I, 

£ is diagonal 
S h =0, 
S Q = 



•/ = /! ' I £ + ] / 1 



N 



J = J2 



n=l ff » 



2 A y A 



l+7n 

V 2 hl 



w cx f2p X Gr P ( £ + I /<■ 



W cx diag ([ui,«2, • ■ -,vn]) , 

-i 



1 



l+7» 



(E) 
Connected 



A = 11 J 



J = J 



J = h T E h 



l + r?J 



9 T (E 



W ocuv , u = + — | ,g. 
u = E~ h 



(F) 

Cycle topology, 
Homogeneous, 
Equicorrelated- 

(E, Eh, Eg) 



A 
h 

E h 
E 



C(K), 

hl(l - ah) I, 
a 2 R(p), 

goi 1 -^) 1 



j = 



PN + Oth [pN + 



1 

— I a n 

N 



N 



3-) {l + PK + ah (pk + %) 



Pt = P H 7—, 7 = — o-, a = !• 

r a A a 



w cx 1^. 



Note: K = 1 =S> Distributed, K = N ^> Connected 
a g = 1 =*> Perfect CSI, a h = 1 Perfect OGI 



Other definitions: T P = [G T Slp 1 G\ , E = E + r; 2 E h , R{p) = ((1 - p)I + pll T ) 



Appendix H 
Summary of main results (Tables |T1and|iT|) 
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TABLE II 

Main results: individual power constraint (Examples 4, 5 and 6) 



Cases 



Conditions 



Optimal (equivalent) 
Fisher Information 



Optimal weights 



(A) 
Distributed, 
Uncorrelated, 
Perfect OGI, 
Perfect CSI, 
HI 



A = I, 

X is diagonal 
S h = 0, 
S a =0 



J =F pi (a, 6, c) 



■m m ' ^ m 



= \n 



PS. 



W = diag (t opt ) 



(B) 
Distributed, 
Homogeneous, 
Equicorrelated- 

(X, Eft, E„) 



(C) 
Connected, 
Uncorrelated- 
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